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Abstract: A Boolean function is symmetric if it is invariant under all permutations of its arguments; it is 
quasi- symmetric if it is symmetric with respect to the arguments on which it actually depends. We present a 
test that accepts every quasi-symmetric function and, except with an error probability at most (5 > 0, rejects 
every function that differs from every quasi-symmetric function on at least a fraction e > of the inputs. For 
a function of n arguments, the test probes the function at 0((n/e) log(n/(5)) inputs. Our quasi-symmetry 
test acquires information concerning the arguments on which the function actually depends. To do this, it 
employs a generalization of the property testing paradigm that we call attribute estimation. Like property 
testing, attribute estimation uses random sampling to obtain results that have only "one-sided" errors and 
that are close to accurate with high probability. 



1. Introduction 

Suppose that we are given a Boolean function / : {0, 1}" {0, 1} of n arguments, and that we wish 
to determine whether or not it is symmetric. (Such a function / is symmetric if it is invariant under all 
n! permutations of its arguments: /(xi, . . . , a;„) ~ f{x^(^i-^,...,x^(^n)) ^ot all {xi,...,Xn) G {0,1}" and all 
bijections tt : {1, . . . , n} {1, . . . , n}. Equivalently, / is symmetric if it is constant on sets of settings of 
its arguments that have equal Hamming weight: f{x) = f{y) if |a;| = \y\, where the Hamming weight \x\ of 
X G {0, 1}" is the number X^i<j<„ Xi of Is among xi, . . . ,Xn-) We seek to do this by querying the value of 
the function at various points (settings of its argument values), and we endeavor to minimize the number of 
queries that we make. 

It is not hard to sec that in the worst case, we may need to make 2" — 2 queries. For if / is the "all 
Os" constant function, wc arc bound to find that it is symmetric, but we dare not announce this conclusion 
before querying all 2" - 2 points other than "all Os" (0, ... ,0) G {0, 1}" and "all Is" (1, . . . , 1) G {0, 1}". 
(For if (xi, . . . , Xn) is such a point not queried, then the function g that assumes the value 1 at and only at 
the point {xi, . . . ,Xn) would be a non-symmetric function that assumes the same value as / at all queried 
points.) 

This situation may be summarized by saying that a "witness" for symmetry (a set of points at which the 
values of a function ensure its symmetry) must contain 2" — 2 points. A witness for non-symmetry, however, 
may consist of just two points, x and y, such that |a;| = \y\, but for which f{x) ^ /(y)- This suggests that 
we may test for symmetry more efficiently if we are willing to allow a small probability of a "false positive" 
(when we declare a function to be symmetric when in fact it is not). 

This suggestion leads us to the paradigm of "property testing" . To introduce this notion to the current 
context, we shall need some definitions. If / and g are Boolean functions of n arguments, the distance 
A(/, g) between them is the fraction of their truth-table entries on which they differ. This fraction may be 
written as A(/, g) = | / © g|/2", where / ® g denotes the "cxclusive-or" or "sum modulo 2" of / and g, and 
the Hamming weight \f\ of a Boolean function / is the number '}2xe{o i}" /(^) "-"^ truth-table. This 

is a proper metric on the set B„ of Boolean functions of n arguments (that is, it is non-negative, vanishes 
only when the functions are equal, is symmetric and satisfies the triangle inequality). Furthermore, if we 
extend it by agreeing that functions of different numbers of arguments are at "infinite distance" , it remains 
a metric, now defined on the set B = U„>o °^ Boolean functions. If T and Q are sets of Boolean 
functions, we define A{f,Q) = miiig^g A{f,g) and A{J^,g) = min/gjF A(/, ^). If A(/, ^) > e, we shall say 
that / is e-far from Q. 

By a test for some property of Boolean functions (reified as a set G of Boolean functions) , we shall mean 
a randomized algorithm that takes a Boolean function / and two parameters £ > and ^ > 0, queries the 
value of / at a finite sequence of points (where the number of queries, and choice of later points may depend 
on the outcomes of earlier queries), and announces an answer yes or no, where (1) if / has the property in 

question (that is. / G Q), then the algorithm answers yes, and (2) if / is £-far from the set of functions with 
the property in question (that is, A(/, G) > e), then the algorithm answers no unless an event of probability 
at most 6 has occurs. (Note that "probability" here refers to the randomization of the algorithm; the bound 
6 applies uniformly to all / that are £-far from G.) 

Property testing, in the sense used here, was introduced by Rubinfeld and Sudan [R] and further 
developed by Goldreich, Goldwasser and Ron [G3]. The definition has many variants and has been applied 
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to properties of many types of objects. A central example involving Boolean functions is testing monotonicity 
(see Goldreich, Goldwasser, Lehman and Ron (and, in the later version, Samorodnitsky) [Gl, G2]). 

In Section 2, we shall give a test for symmetry that makes at most 0((l/e) log(l/(5)) queries. Although 
the test is randomized, this bound on the number of queries is uniform (not merely a bound on expectation, 
nor merely one that holds with high probability), and it is independent of the number of arguments. When 
the algorithm returns no, it also provides a witness to the non-symmetry of /. (Of course, when yes is 
returned, it may be a false positive, and in any case a witness to symmetry would be too large establish 
within the stated number of queries.) 

We shall say that a Boolean function / : {0, 1}" {0, 1} of n arguments is quasi- symmetric if it is a 
symmetric function of those of its arguments that it actually depends on. (We say that / depends on its i-th 
argument if there arc Boolean values xi,. . . , Xj+i, . . . ,Xn such that f{xi, . . . , Xi-i,0, Xi+i, . . . , Xn) 

/(Xi, . . . l,Xi+i, . . .,Xn).) 

In Section 4, we shall give a test for quasi-symmetry that makes at most 0((n/e) log(n/(5)) queries. 
In this case, the number of queries depends on the number of n of arguments as well as on s and 6. This 
happens because the test begins by attempting to determine the set J^{f) C {1, . . . ,n} of arguments on 
which the function / actually depends, and this set could be as large as the full set of n arguments. 

Upon considering the subproblem of determining the set Jif) from the function /, we see that it 
presents many of the same characteristics as testing for symmetry: a witness for the fact that / depends on 
its z-th arguments can comprise just two points (as exemplified in the definition), but a witness for the fact 
that / does not depend on some particular argument cannot be smaller than all 2" points. Thus, instead 
of determining J7'(/) exactly, we shall introduce a notion of "estimating" such an attribute that is exactly 
analogous to the notion of "testing" a property. 

A function T> : B ^ C defined on the set of all Boolean functions, and taking values in a partially ordered 
set C, will be called an attribute of Boolean functions. An estimate for the attribute P is a randomized 
algorithm that takes a Boolean function / and two parameters e > and S > 0, queries the value of / 
at a finite sequence of points (where the number of queries, and choice of later points may depend on the 
outcomes of earlier queries), and announces an output D G C, where (1) 2?(/) > D, and (2) / is e-far from 
the set 'D~^{D) of functions for which T> takes on the value D only if an event of probability at most 6 
occurs. If we take C = {yes, no} with yes < no, then estimating such an attribute reduces to testing the 
corresponding property T>~^{yes). 

In Section 3 we shall give an estimate for the set >/(/) of arguments that / depends on (with 
the codomain, the power set of {!,..., n}, ordered in the usual way by inclusion) that makes at most 
0((n/e) log(n/5)) queries. The algorithm will also provide a witness for the fact that the value it returns is 
a lower bound for J{f). This estimate will be used in Section 4 as the basis for our quasi-symmetry test, 
and the witness that it provides will be used to construct a witness for non-quasi-symmetry when that is 
detected. 

The problem of testing whether a Boolean function depend on a small subset of arguments has been 

attacked by Parnas, Ron and Samordnitsky [PI. P2], and later by Fischer, Kindler, Ron, Safra and Samorod- 
nitsky [F]. Their work, however, lies entirely within the framework of property testing: their algorithms test 
whether the cardinality #i/(/) is at most k (k a constant), without giving further information about the 
set J{f) (and they do this with a number of queries that is independent of n). Our quasi-symmetry test. 
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however, requires more information about J{f) than merely its cardinality, and this requirement led us to 
our formulation of the notion of attribute estimation. 

Despite its naturalness and its analogy to property testing, attribute estimation does not appear to have 
been described in the previous literature. Wc hope, however, that it will find other applications, and indeed 
that the notion of attribute estimation, both as used here and as extended to other domains such as graphs, 
will prove a fruitful contribution to the theory of statistical algorithms. 



2. Testing Symmetry 

In this section, we shall present our symmetry test and analyze its performance. We shall begin by 
describing what we shall call a "basic step" of the algorithm. 

Symmetry Test — Basic Step: (0) Given Boolean function / : {0, 1}" — > {0, 1} of n arguments, return yes 
if < n < 1. (1) Choose point x = {xi, . . . ,Xn) at random, with all 2" — 2 points other than (0, . . . ,0) 
and (1, . . . , 1) being equally likely. (2) Choose point y = (yi, . . . , ?/„) at random, with all (|"|) — 1 points 
other than x, but having the same Hamming weight as x, being equally likely. (Since n > 2 and x ^ 
{(0, . . . , 0), (1, . . . , 1)}, there is at least one possible choice for y.) (3) Query f{x) and f{y). If f{x) = f{y), 
then return yes, else return no. 

Clearly, if / is symmetric, then this procedure returns yes. Let <S denote the set of all symmetric Boolean 
functions. We shall see that if / is not symmetric, then the procedure returns no with probability at least 
A{f,S). To see this, let A C {0,1}" be a set of points of minimum cardinality such that complementing 
the value of / at just those points in A yields a symmetric function. This minimum cardinality is =f^A = 
A(/,5)2". For any x £ {0,1}", let C {0,1}" denote the set of points y such that \y\ = |x| but 
f(y) ^ f[x). The probability that the basic step chooses x £ A is A(/,5)2"/(2" - 2) > A{f,S). Given 
that x G A, the probability that the basic step chooses y G B^ is at least |(|"|)/ ((|"|) — > ^ (since if 
#Bx were less than we could replace the points of A having Hamming weight |a;| by those of B^, 

reducing the cardinality of A and contradicting the definition of A). The procedure returns no if a; S A 
and y E Bx, which occurs with probability at least ^A{f,S), and also if y <E A and x E By, which occurs 
disjointly with a probability that is also at least ^ A(/, S) (since the joint distribution of x and y is invariant 
under the exchange of x and y). This completes the proof that the basic step returns no with probability at 
least A(/,5). 

Now we present our complete symmetry test. 

Symmetry Test: (0) Given Boolean function / : {0, 1}" {0, 1} of n arguments, and real numbers < e < 1 
and 0<(5<1, ifO<n<l, then return yes. (1) Otherwise compute 



k = 



1, 1 



(2) Perform the basic step until it returns no, or until it has returned yes k times. (3) If any performance 
of the basic step returned no, then return no; if all k performances returned yes, then return yes. 

Clearly, if / is symmetric, then this procedure returns yes. We shall see that if / is £-far from the 
symmetric functions, then the procedure returns no with probability at least 1 — ^. If / is £-far from 
the symmetric functions, then each performance of the basic step returns yes with probability at most 
1 — A(/, iS) < 1 — £. The symmetry test returns yes only if k performances of the basic step return yes, 
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and this occurs with probability at most (1 — e)'' < e"^*^ < 6 (where we have used the inequality 1 + x < 

and the definition of k). This completes the proof that, if if / is £-far from the symmetric functions, then 
the procedure returns no with probability at least 1 — (5. 

Wc now have the following theorem. 



We conclude this section by observing that when the basic step returns no, the two points it has queried 
witness the non-symmetry of the function, and when the complete symmetry test returns no, such a witness 
is provided by the final performance of the basic step. 

3. Estimating Dependence 

In this section we shall present an estimate, as defined in the introduction, for the set J{f) of arguments 
that / depends on. We begin by describing a constancy test analogous to the symmetry test presented in 

the preceding section. 

Constancy Test — Basic Step: (0) Given Boolean hmction / : {0, 1}" {0, 1} of n arguments, return yes 
if n = 0. (1) Choose point x = (xi, . . . ,a;„) at random, with all 2" points being equally likely. (2) Choose 
point y = (yi, . . . ,yn) at random, with all 2" — 1 points other than x being equally likely. (3) Query f(x) 
and f{y). If f{x) = f{y), then return yes, else return no. 

Clearly, if / is constant, then this procedure returns yes. Let C denote the set of all constant Boolean 
functions. We shall see that if / is not constant, then the procedure returns no with probability at least 
A(/, C). To see this, let A C {0, 1}" be a set of points of minimum cardinality such that complementing the 
value of / at just those points in A yields a constant function. This minimum cardinality is =ffA = A(/, C) 2". 
For any x G {0,1}", let C {0,1}" denote the set of points y such that f{y) ^ fix). The probability 
that the basic step chooses x £ A is A{f,C). Given that x € A, the probability that the basic step chooses 
y G Bx is at least 2"~^/(2" — 1) > 5 (since if #5^ were less than 2"~^, we could replace the points of A 
by those of B^, reducing the cardinality of A and contradicting the definition of A). The procedure returns 
no if a; € A and y £ B^, which occurs with probability at least ^A(/, C), and also ii y € A and x £ By, 
which occurs disjointly with a probability that is at least iA(/, C) (since the joint distribution of x and y 
is invariant under the exchange of x and y). This completes the proof that the basic step returns no with 
probability at least A(/, C). 

Now we present our complete constancy test. 

Constancy Test: (0) Given Boolean function / : {0, 1}" — > {0, 1} of n arguments, and real numbers < £ < 1 
and < 5 < 1, if n = 0, then return yes. (1) Otherwise compute 



(2) Perform the basic step until it returns no, or until it has returned yes k times. (3) If any performance 
of the basic step returned no, then return no; if all k performances returned yes, then return yes. 



Theorem 2.1: There is a test for symmetry that makes at most 0((l/£) log(l/5)) queries. 
Proof: The number of queries made is at most 




□ 
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Clearly, if / is constant, then this procedure returns yes. We shall see that if / is e-far from the constant 

functions, then the procedure returns no with probability at least 1 — 5. If / is e-far from the constant 
functions, then each performance of the basic step returns yes with probability at most 1 — A(/, C) < 1 — e. 
The complete test returns yes only if k performances of the basic step return yes, and this occurs with 
probability at most (1 — e)*^ < e~^*^ < 5 (where we have used the inequality 1 + x <e^ and the definition of 
k). This completes the proof that, if if / is £-far from the constant functions, then the procedure returns no 
with probability at least 1 — ^. 

We now have the following lemma. 

Lemma 3.1: There is a test for constancy that makes at most 0((l/£) log(l/5)) queries. 

Proof: The number of queries made is at most 



We observe that when the basic step returns no, the two points it has queried witness the non-constancy 
of the function, and when the complete constancy test returns no, such a witness is provided by the final 
performance of the basic step. 

The next component we shall need is a procedure that takes a pair of points that witness the non- 
constancy of a function and returns a particular argument on which the function depends. We shall call this 
operation a "dependency search" . 

Dependency Search: (0) Given Boolean function / : {0, 1}" — > {0, 1} of n arguments, and a pair of points 
x,y G {0, 1}" such that f{x) ^ f{y), set x' := x and y' := y. (1) If \x' ® y'\ = 1, then return the unique 
i G {1, . . . ,n} such that a;- ^ y^. (2) Otherwise, let z G {0, 1}" be such that \x' ® z\ and \y' ® z\ are each at 
most \\x' ® 2/'|/2l (so that z is about half-way between x' and y'). (3) Query f{z). If f{x') ^ f{z), then set 
y' := z, else set x' := z. (4) Go back to step (1). 

Lemma 3.2: There is a procedure for dependency search that makes O(logn) queries. 

Proof: In the procedure given above, the quantity \x' y'\ is initially at most n. The procedure reduces this 
quantity by multiplying it by a factor at most 2/3 whenever it makes a query, and stops when this quantity 
reaches 1. Thus it makes at most logg/j n queries. □ 

We observe that the final values of x' and y in this dependency search provide a witness that / actually 
depends on the i-th argument. 

We now present our complete dependency estimate. 

Dependency Estimate: (0) Given a Boolean function / : {0, 1}" — > {0, 1} of n arguments, and real numbers 
< e < 1 and < ^ < 1, let ^' = 5/n and set J' := 0. (1) Set xji = {xj : j € J'} to random Boolean values, 
with all 2#'^ assignments being equally likely, and let /' : {0,1}""'^'^ {0,1} be the Boolean function 
obtained from / by substituting the values xji for the arguments in J'. (2) Perform a constancy test on the 
function /' with parameters e and 6'. (3) If this test returns yes, then return J'. (4) Otherwise, perform a 
dependency search on the witness returned by the constancy test, and adjoin the argument j returned by 
the dependency search to J': J' := J' U {j}. (5) Go back to step (1). 

If J denotes the set returned by this procedure, it is clear that J C J{f). Indeed, this fact is witnessed 
by the set of pairs of points x' ,y' that terminate the dependency searches that found the arguments in J. 
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Let Bj be the set of Boolean functions that actually depend only on the arguments in J. We shall show 

that that if / is £-far from any function in Bj, then an event of probability at most S has occurred. If / 
is £-far from any function in Bj, we have A(/, Z?,/) > e. For each performance of the constancy test, we 
have J' C J; this implies A(/, Sj') > A(/, Sj), and thus we have A(/, Sj) > e. One of these at most n 
constancy tests must return yes, and each test returns yes with probability at most 5' = 5/n. Thus the 
probability that any of these tests returns yes is at most nd' = 5. Finally, the number of queries made is at 
most 

n (o log + O(logn)^ = " ('^ ( J + 0{logn)^ = O log |) . 

We now have the following theorem. 
Theorem 3.3: There is an estimate for the dependency set that makes at most 0{{n/e) log(n/(5)) queries. 



4. Testing Quasi-Symmetry 

In this section, we present our quasi-symmetry test. Again we begin by describing a basic step. 

Quasi-Symmetry Test — Basic Step: (0) Given a Boolean function / : {0, 1}" {0, 1} of n arguments, a real 
number < e < 1, and a set J of arguments, set / = {1, . . . , n} \ J. (1) Set xi = {xi : i e /} to random 
Boolean values, with all assignments being equally likely, and let /' : {0, 1}^"' {0, 1} be the Boolean 
function obtained from / by substituting the values xi for the arguments in /. (2) Perform a symmetry test 
on /', with parameters e and 1/2, and return the value returned by this symmetry test as the value of the 
basic step. 

Suppose that the basic step is performed on a function / that actually depends on all the arguments 
in J. If / is quasi-symmetric, then because / actually depends on all the arguments in J and /' depends 
only on these arguments, /' is symmetric and the basic step will return yes. Let g( € Bj be a function that 
minimizes A{f,g). This minimum distance is A{f,g) — A(/, Bj). Now suppose further that A{f,g) < e. 
Then we shall show that if / is 4e-far from the set Q of quasi-symmetric functions, the basic step returns 
no with probability at least 1/4. By the triangle inequality we have A{f,g) + A{g, Q) > A(/, Q) > Ae, 
and thus A{g,Q) > 3e. Let g' be obtained from g by substituting the random values xj for the arguments 
in I. Of course g' is independent of these random values, since g does not depend on the arguments in 
/. Furthermore, A{g',S) > A{g', Q) > A{g, Q) > 3e, since <S C Q and g does not depend on any of its 
arguments that are not arguments of g'. On the other hand, A{g' , /') is a random variable, since /' may 
depend on the random values xj. The expected value of A{g' , f) is clearly A(g, /) = A{f,g) < e, so 
with probability at least 1/2 we have A{f',g') < 2£, by Markov's inequality. When this happens, we have 
^{g'j /') + ^(/'; '5) > A{g', S) > 3e, again by the triangle inequality, and thus A(/', S) > e. This condition 
ensures that the symmetry test returns no with probability at least 1/2. This completes the proof that when 
/ is 4£-far from the set Q of quasi-symmetric functions, then the basic step returns no with probability at 
least 1/4. 

Quasi-Symmetry Test: (0) Given a Boolean function / : {0, 1}" {0, 1} of n arguments, and real numbers 
< e < 1 and < 5 < 1, let e' = e/A and 5' = 5/2, and compute 



k = 



log. 
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(1) Perform a dependency estimate on /, with parameters e' and 5' , and let J be the value returned by 

that procedure. (2) Perform the basic step with parameters /, e' , and J. until it returns no, or until it 
has returned yes k times. (3) If any performance of the basic step returned no, then return no; if all k 
performances returned yes, then return yes. 

Suppose first that / is quasi-symmetric. The function / actually depends on all the arguments in the 
set J found in step (1). Thus each performance of the basic step returns yes, and so the quasi-symmetry test 
returns yes after k such performances. Suppose on the other hand that / is £-far from any quasi-symmetric 
function. Then, except with probability at most 5', A(^f,Bj) < e' for the set J found by step (1). Thus 

each performance of the basic step returns no with probability at least 1/4, and therefore returns yes with 
probability at most 3/4. The probability that all k performances of the basic step return yes is thus at 
most (3/4)*^ < 6' . Thus if / is e-far from the quasi-symmetric functions, the quasi-symmetry test returns no 
unless an event of probability at most 6' + 6' = 6 occurs. 

Finally, the number of queries made is at most 

We now have the following theorem. 
Theorem 4-3: There is a test for quasi-symmetry that makes at most 0(^{n/e) \og{n/5)) queries. 

We observe that when the quasi-symmetry test returns no, a witness to the non-quasi-symmetry of / 
can be obtained from the witnesses provided by the dependency estimate and the final performance of the 
symmetry test. 
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